Communities and bottlenecks: Trees and treelike networks have high modularity 
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Much effort has gone into understanding the modular nature of complex networks. Communities, also known 
as clusters or modules, are typically considered to be densely interconnected groups of nodes that are only 
sparsely connected to other groups in the network. Discovering high quality communities is a difficult and im- 
portant problem in a number of areas. The most popular approach is the objective function known as modularity, 
used both to discover communities and to measure their strength. To understand the modular structure of net- 
works it is then crucial to know how such functions evaluate different topologies, what features they account for, 
and what implicit assumptions they may make. We show that trees and treelike networks can have unexpectedly 
and often arbitrarily high values of modularity. This is surprising since trees are maximally sparse connected 
graphs and are not typically considered to possess modular structure, yet the nonlocal null model used by mod- 
ularity assigns low probabilities, and thus high significance, to the densities of these sparse tree communities. 
We further study the practical performance of popular methods on model trees and on a genealogical data set 
and find that the discovered communities also have very high modularity, often approaching its maximum value. 
Statistical tests reveal the communities in trees to be significant, in contrast with known results for partitions of 
sparse, random graphs. 
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I. INTRODUCTION 

Complex networks have made an enormous impact on re- 
search in a number of disciplines Networks have rev- 
olutionized the study of social dynamics and human con- 
tact patterns JSHH), metabolic and protein interaction in a 
cell 121 [10), ecological food webs lfm - [T3l . and technologi- 
cal systems such as the World Wide Web |[T4l [T5l and air- 
line transportation networks lfT6l[T7l . Seminal results include 
the small-world [18] and scale-free nature lfT4ll of many real- 
world systems. 

One of the most important areas of network research has 
been the study of community structure 1 1 9 , 20 1 . Communities, 
sometimes called modules, clusters, or groups, are typically 
considered to be subsets of nodes that are densely connected 
among themselves while being sparsely connected to the rest 
of the network. Networks containing such groups are said 
to possess modular structure. Understanding this structure is 
crucial for a number of applications from link prediction CD 
and the flow of information 1 22 1 to a better understanding of 
population geography |[23T[25l . 

Much effort has been focused on finding the best possi- 
ble partitioning of a network into communities. Typically, 
this is done by optimizing an objective function that mea- 
sures the community structure of a given partition. Many al- 
gorithmic approaches have been devised. Most partition the 
entire network while some focus on local discovery of individ- 
ual groups [26-28 1 . Overlapping community methods, where 
nodes may belong to more than one group, have recently at- 
tracted much interest 129H3TI . For a lengthy review of com- 
munity methods see |[T9l . 
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Given the reliance on objective functions, it is important 
to understand how the intuitive notion of communities as in- 
ternally dense, externally sparse groups is encoded in the ob- 
jective function. Some functions simply measure the density 
of links within each community, ignoring the topological fea- 
tures those links may display, while other functions rely upon 
those links forming many loops or triangles, for example. We 
show the importance of understanding these distinctions by 
revealing some surprising features of how communities are 
evaluated. In particular we show that the only requirement 
for strong communities, according to the most popular com- 
munity measure, is a lack of external connections, that bottle- 
necks [32] leading to isolated groups can make strong com- 
munities even when those groups are internally maximally 
sparse. This contradicts the notion of communities as being 
unusually densely interconnected groups of nodes. 



This paper is organized as follows. In Sec. [TT] we present 
several measures of community quality and discuss their dif- 



ferent features and purposes. In Sec. Ill we show analytically 



that trees and treelike graphs can possess partitions that dis- 
play very high, often arbitrarily high values of modularity. 



This is our primary result. In Sec. IV we apply two success- 
ful community discovery algorithms to these trees and show 
that the discovered communities can have even higher modu- 
larities. We also study the community structure of a treelike 
network derived from genealogical data. In Sec. [V] we per- 
form statistical tests on the various communities and find that 
most of the partitions we consider for trees are statistically 
significant. We finish with a discussion and conclusions in 
Sec. EI] 
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II. MEASURING COMMUNITIES 

Given a network, represented by a graph G of N nodes and 
M links whose structure is encoded in an NxN adjacency ma- 
trix A, where Ajj — 1 if nodes i and j are connected and zero 
otherwise, we wish to determine to what extent G possesses 
modular structure. To put the notion of a community or mod- 
ule onto a firm foundation, objective functions have been in- 
troduced to quantify how "good" or "strong" a community or 
a partitioning into communities is. These objective functions 
are also often the goal of an optimization algorithm, where 
the algorithm attempts to find the community or communities 
that maximizes (or minimizes) the objective function. Here 
we briefly discuss three objective functions: subgraph conduc- 
tance, modularity, and partition density. Due to its popularity 
and wide use we will focus primarily on modularity. 

A. Conductance 



significance of the subgraph. It may be possible for a random- 
ized null graph to contain subgraphs exhibiting comparable 
conductance, for example, and conductance alone does not 
capture this. Modularity If34l [35l was introduced to account 
for this in an elegant way. It has become the most common 
community objective function |[T9ll20l and possesses a num- 
ber of distinct advantages over previous approaches, such as 
not requiring the number of communities to be known in ad- 
vance. However, it has some drawbacks as well. It is known 
to possess a resolution limit where it prefers communities of a 
certain size that depends only on the global size of the network 
and not on the intrinsic quality of those communities fl36ll37l . 
Meanwhile, sparse, uncorrelated random graphs are expected 
not to possess modular structure, but fluctuations may lead 
to partitions with high modularity [38-40]. Yet another con- 
cern is modularity's highly degenerate energy landscape ATI , 
which may lead to very different yet equally high modularity 
partitions. 

Modularity Q can be written as 



The conductance cf> of a subgraph is a measure of how 
'isolated' the subgraph is, in analogy with electrical conduc- 
tance ll33l . Subgraphs with many connections to the rest of 
the network will have high conductance, whereas a subgraph 
will have low conductance if it relies on a few links for exter- 
nal connectivity. For a given subgraph S such that \S\ < N, 
one form of conductance is 

ZyA;;[j € S][j <t S] K s - 2m s 



0(5) = 
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where [P] = 1 if proposition P is true and zero otherwise, 
Ks = TiijAij[i e S] is the sum of the degrees (number of 
neighbors) of all nodes in S , and ms is the total number of 
links in S . (The factor of 2 in the denominator is sometimes 
dropped.) In other words, subgraph conductance is the ratio 
between the number of links exiting the subgraph to the num- 
ber of links within the subgraph. 

While low <p may appear to be a good indicator of commu- 
nity structure, we remark that it primarily measures isolation 
or "bottleneckedness," meaning that, e.g., a random walker 
moving in a subgraph with low conductance will have very 
few opportunities to exit the subgraph, whereas it would have 
many opportunities if the subgraph had high conductance. 
This is also true if the subgraph is a densely interconnected 
module. However, consider a large two-dimensional (2D) pe- 
riodic square lattice of size L x x L v , L x > L y . This graph has 
N = L x L y nodes and M = 2N links and is generally con- 
sidered to have no modular structure. The conductance of a 
subgraph created by cutting the lattice in half along the y di- 
rection is (p — 2L y / \L x Ly) = 2/L x . As the lattice grows, the 
conductance of this subgraph decreases, despite there being 
no modular structure. 



B. Modularity 

A key point lacking in earlier definitions of communities 
such as conductance is that they fail to quantify the statistical 
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where M = ^zZij^ij is the total number of links in the 
network, c, is the community containing node i, m c = 
\ YjijAij\pi = c][cj = c] is the total number of links inside 
community c, and K c = 2, £;[c, = c] is the total degree of all 
nodes in community c. The first definition of Q illustrates the 
intuition of its form: For every node pair that shares a commu- 
nity we sum the difference between whether or not that pair is 
actually linked with the expected "number" of links between 
those same two nodes if the system was a purely random net- 
work constrained to the same degree sequence (this null model 
is known as the configuration model, and the loss term is ap- 
proximate). This is then normalized by the total number of 
links in the network. By rewriting the sum over node pairs as 
a sum over the communities themselves, the second definition 
of Q makes clear the resolution limit: global changes to the to- 
tal number of links M will disproportionately affect each com- 
munity's local contribution to Q. This can potentially shift the 
maximal value of Q to a different partition even when the local 
structure of the communities remains unchanged. 

Equation (|2| gives values between - 1 and 1 . When Q ~ 0, 
there is strong evidence that the discovered community struc- 
ture is not significant, at least according to this null model, 
while the communities are considered better and more signif- 
icant as Q grows. In practice, researchers may assume that 
a network possesses modular structure when Q > 0.25 or 
0.3 (34). However, since fluctuations can induce high mod- 
ularity in random graphs, one must always approach the raw 
magnitude of Q with caution: statistical testing (Sec.[V]i may 
provide stronger evidence for the presence of modules than 
modularity alone [38]. 



C. Partition density 

Yet another approach to quantifying community structure is 
that of partition density [ 30 1 . Partition density was introduced 
specifically for the case of link communities, where links in- 
stead of nodes are partitioned into groups. This allows for 
communities to overlap since nodes may belong to multiple 
groups simultaneously. We do not consider overlapping com- 
munities here, but partition density can still be calculated for 
nonoverlapping node communities. 

The partition density D is 
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Partition density measures, for each community, the number 
of links within that community minus the minimum number 
of links necessary to keep a subgraph of that size connected, 
n c - 1, the size of its spanning tree. This is then normalized by 
the maximum and minimum number of links possible for that 
connected subgraph, ( "A and n c —l, respectively. The partition 
density is then the average of this quantity over the communi- 
ties, weighted by the fraction of links within each community. 
For a link partition that covers an entire connected network, 
we have £ c m c = M, but this does not necessarily hold for a 
node partition. 

A crucial feature of the partition density is that it explic- 
itly compares the link density of a subgraph to that of a tree 
of the corresponding size. This controls for the fact that the 
subgraph in question is connected, making the reasonable as- 
sumption that communities should be internally connected. 
The null model used by modularity, on the other hand, does 
not make this assumption, and it may potentially assign very 
low probabilities to such an event. As we will show, this is a 
crucial aspect of modularity. 



III. COMMUNITIES IN TREES AND TREELIKE GRAPHS 

We now study a model tree graph that one may consider 
to not possess modular structure and show that these graphs 
possess partitions with arbitrarily high modularity values. We 
also study a mixed case graph containing both modular and 
non-modular structures. 



A. Cayley tree 

The Cayley tree is a regular graph with no loops and where 
every node i has the same degree ki = z + 1 (except for leaf 
nodes on the boundary which possess k = 1). See Fig. [T] It 
can be constructed by first starting from a root node at genera- 
tion 0, giving that node z + 1 child nodes, and then repeatedly 
giving each new child z children of its own. This continues for 
a fixed number of generations g. These trees can grow either 
in "width" (via z) or in "depth" (via g). The number of nodes 
in generation g > is n(g) — (z + l)z* , and the total number 
of nodes is N(g) = 1 + "(<?')■ Since this is a tree, the total 




FIG. 1. Cayley tree for z 
in white. 



2 and g = 3. The root node is indicated 



number of links is M(g) = N(g) - 1 = (z + 1)(1 - z s )/(l - z). 
Since the bulk of the graph is regular, the Cayley tree has no 
density fluctuations (all connected subgraphs of the same size 
have the same number of links), and so it does not in an obvi- 
ous way conform to our preconceived notions of communities 
as internally dense, externally sparse groups. In the thermody- 
namic limit the Cayley tree is known as the Bethe lattice. We 
concern ourselves here primarily with finite graphs, however, 
such that finite size and edge effects cannot be ignored. 

We now compute the modularity of a specific partition of 
the Cayley tree, which we call the analytic partition. First 
place the root node into a community of its own. Then create 
a new community for each child of the root node, containing 
that child and all of its descendants. Thus there are z + 2 
communities in total. Apart from the singleton community 
containing the root node, every community is a complete z-ary 
tree (which is not exactly a Cayley tree) with g— 1 generations. 
Partitioning the tree in this way requires cutting only z + 1 
links. There are zero links inside the singleton community 
and 
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links inside the z + 1 other communities. To compute the total 
degree of nodes within the community, we note that all [N(g)— 
l]/(z + 1) nodes have degree z + 1 except the n(g)/{z +1) 
boundary nodes that have degree 1 . Thus the total degree is 

*= ( z + i)(^-^4W nfe) 
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The final modularity is then given by substituting these ex- 
pressions for m, K, and M into: 
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where the functional dependence on z and g has been sup- 
pressed. For z — 10 and g — 4, for example, Qcayiey ~ 0.91, 
an extremely high modularity. Even for z — 3 and g — 3 we 
have a high modularity of 2 C ayiey ~ 0.7. (Raw modularity val- 
ues must be approached with caution; we will quantify these 
numbers in Sec.|V]) In general, the limiting value of (2cayiey 
for a given z is 
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Even for a finite g > 1, Qcayiey — > 1 as z — » oo. Thus the 
Cayley tree is able to achieve arbitrarily high modularity 
partitions. (We will later show these partitions to also be sta- 
tistically significant.) This is not the only partition capable of 
achieving high modularity. We discuss another partition in the 
Appendix. 

Meanwhile, the z + 1 branch communities of the Cayley 
tree's analytic partition each have conductance 
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For z = 4 and g = 10, for example, 
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very small value. This makes sense since only a single link 
separates that entire branch from the rest of the graph. This 
also emphasizes that conductance is primarily a measure of 
bottlenecks and isolation and should be approached with cau- 
tion when applied to community structure. 

Finally, we remark that the partition density of the Cayley 
tree is zero since m c = n c — 1 . This is true not just for the 
analytic partition but for all partitions of the Cayley tree where 
each community is connected. 



B. A clique and a tree 

In practice, one may deal with networks with wide fluc- 
tuations in local density, meaning there may exist localized 
subgraphs of low and of high density at the same time. We an- 
alyze a simple example consisting of a single complete graph 
known as a clique connected by one link to the root of a z-ary 
tree of g generations. See Fig. [2] 

We wish to compute the modularity of a community parti- 
tion containing the entire clique in one community and the 
entire tree in the other, where only the link from the root 
of the tree to the clique was cut. We assume that there 
are n c ii que nodes in the clique and n tree = (l - / (1 - z) 
nodes in the tree. The numbers of links in each subgraph 
are m cUque = ("* que ) and m tree = z(l -z s )/(l - z), respec- 



tively. The total number of links is M = m, 



clique 



+ 1, 



and the total degrees are K c i ique = n c i ique + (nciique - l) 2 and 
K tlee = z g + (z + 1) (1 - Z 8 ) I (1 - z). The final modularity of 
the partition is then given by substituting these expressions 
into 
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FIG. 2. (a) A mixed test case consisting of a single clique (complete 
subgraph) of n e u que nodes connected by a single link to a z-ary tree. 
This is partitioned into communities by cutting the single bridging 
link, (b) A generalization where now T trees are connected to the 
single clique (T < n dique ). 



We plot Eq. Q as a function of g in Fig. |3ja) for n c ii que = 
100 and several values of z. We see that Q attains a maximum 
of 1/2, a value not as high as the pure Cayley tree previously 
analyzed despite the addition of a "perfect" community. We 
also see that, as z increases, Q becomes more sharply peaked 
as a function of g. This is due to the resolution limit: the larger 
z is, the more quickly the tree will grow from one generation 
to the next, and thus the tree community more quickly passes 
beyond the size preferred by modularity. This leads to a Q 
that grows more rapidly and then decays more rapidly as g 
increases. 

We also study a generalization of Fig.[2|a) from one tree to 
T trees [Fig.|2jb)], where each tree is its own community. For 
this model ;« tlee and K tlee are unchanged for each tree, while 

nOW Clique = T + n c li que ("clique - 1), M = m c i ique + Tm lree + T, 

and 
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We plot Eq. ( fT0] i in Fig. [^b) as a function of n c iique for sev- 
eral values of T. We see that increasing T raises the overall 
modularity of the partition, giving apparently high values of 
Gciique-trees > 0.8. We also see that, as T increases, the curve 
becomes more flat, meaning that good quality partitions, ac- 
cording to modularity, exist for a wide range of clique sizes. 
We remark that this generalization may also be treated by ex- 
ploiting the recursive nature of z-ary trees by merging all the 
tree roots into one node and moving that node into the clique 
(this is particularly simple when T - z). 

In Sec. [V] we study the statistical significance of these 
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(a) 0.5 
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FIG. 3. (a) Modularity for the example network illustrated in 
Fig.[2ja). As we increase the size of the tree for a fixed clique size, 
the modularity grows to a maximum value and then decays away. 
This is due to the resolution limit: there exists a specific tree size 
that maximizes Q for each clique size, (b) For the generalization of 
one clique and T trees, shown in Fig. |2jb), we see that the analytic 
partition again attains high modularity, especially as more trees are 
added. Likewise, as T increases we see that the peak of Q flattens out 
and that the partition has high modularity for a range of clique sizes. 
This means that much of the resolution limit can be compensated for 
if the network is sufficiently treelike. 



clique-tree partitions. 



C. Other trees 

The results above were derived for Cayley and z-ary trees. 
The regular nature of these trees allows for tractable expres- 
sions of modularity, but our results are not limited to these 
types of trees. The important feature in this context is that all 
connected subgraphs of n nodes in any tree will always con- 
tain m = re— 1 links. Since it seems a reasonable basic require- 
ment for a community detection method to discover commu- 
nities that are connected, this density relation is a reasonable 
minimum baseline for a method to be compared against. This 
also means that, since every tree obeys this relation, bottle- 
necks become the primary drivers of high modularity parti- 
tions in all trees. We further explore the generality of our 



results in Sec. IV where we apply community detection algo- 
rithms to random trees. 



IV. REAL- WORLD EXAMPLES 

The above derivations show that trees may possess arbitrar- 
ily high values of modularity. However, these calculations did 
not consider the resolution limit of modularity. In fact, real- 
world optimization of modularity will result in partitions that 
give even higher values of Q than those of the analytic parti- 
tions discussed in Sec. Hill 

To see this we apply two of the most popular and success- 
ful community discovery methods. The first is known as fast 
unfolding (sometimes referred to as the Louvain method) and 
can efficiently find very high modularity partitions B2ll . The 
second method is called infomap ||43l . Infomap does not opti- 
mize modularity, instead exploiting information-theoretic ar- 
guments, but the partitions it does find are often high in mod- 
ularity, especially for undirected networks. 

We apply these algorithms to the Cayley tree. In Fig.|4]we 
plot the modularities discovered by each algorithm and the 
modularities Qcayiey °f the analytic partitions [Eq. i5V]. We 
see that the methods find communities that appear as strong 
as the analytic method or stronger. Fast unfolding typically 
exceeds Qcayiey as the trees grow, and even approaches Q = I. 
Infomap tends to stay closer to Q cay \ ey , but it too can exceed 
these bounds, especially for trees with z — 2. If these methods 
were applied blindly to a network, such high values of mod- 
ularity would suggest that these communities are extremely 
high quality and that the network was extremely modular. 

What about trees other than the Cayley tree? Will such dis- 
covery methods find comparable values of modularity? To an- 
swer this, we now apply these methods to random trees gen- 
erated from a Gal ton- Watson branching process [44|, where 
each node has a random number of descendants drawn from 
a Poisson distribution with mean A. (We also stop grow- 
ing the tree at g generations.) For A — 4 and g = 6, for 
example, we find fast unfolding partitions with modularity 
Q = 0.9814 + 0.0055, while for infomap we find partitions 
with Q = 0.8594 ± 0.0069. This supports the generality of our 
results: high modularity partitions also exist in non-Cayley 
trees. 

As a final practical example, we also apply both methods to 
a treelike network derived from a genealogical dataset captur- 
ing the advisor-advisee relationships between mathematicians 
and their students l45l [461 . (This genealogy is not exactly 
a tree since some students have multiple advisors.) We only 
consider the giant connected component of the network, cap- 
turing approximately 90% of the dataset. In total the network 
has N = 133319 nodes and M = 148247 links. The modular- 
ities of the partitions found by fast unfolding and infomap are 
Qfu = 0.951083 and Q m = 0.877146, respectively. These 
high values would again imply that the network is strongly 
modular; however, statistical testing should be performed to 
support this argument (see Sec.[V|. 

As a brief aside, another interesting aspect of a commu- 
nity partition is the distribution of community sizes (num- 
bers of nodes per community). Since any discovered mod- 
ular network structure depends intrinsically on the definition 
at the heart of the algorithm used to find that structure, it is 
not known for certain what the true distribution is. Never- 
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Tree generation, g Community size, s 



FIG. 4. (Color online) Community discovery methods find even 
higher values of modularity than the analytical partition of the Cay- 
ley tree. We apply two methods: (a) fast unfolding |42| and (b) 
infomap [43 1 for several values of z. Solid symbols correspond to 
Eq. l|6j while open symbols correspond to the modularities found by 
the methods. Fast unfolding finds consistently higher modularity par- 
titions than the analytic partition, due to the resolution limit. These 
partitions even approach Q = 1. Infomap, which does not optimize 
modularity, tends to find partitions comparable to the analytic parti- 
tion, although it too finds higher value partitions for z = 2 and some 
values of g for z = 3. Note that the vertical axes do not begin at 

2 = 0. 



theless, there has been empirical evidence showing that the 
size distribution may exhibit a power law Pr(s) ~ s~ a , for 

a > i GUEo). 

Yet the distributions of community sizes found in the ge- 
nealogical network, shown in Fig. [5] are not heavy -tailed. In- 
stead both methods find approximately exponential distribu- 
tions, with a small number of larger communities that would 
be underrepresented by an exponential distribution. The lack 
of very large communities may be expected in graphs with- 
out hubs, but the degree distribution for this network is heavy 
tailed [Fig. |5jb), inset]. This relatively narrow size distribu- 
tion may provide some warning that the communities found in 
this network differ from typical communities in some mean- 
ingful way, though this is far from certain. Further study of 
this distribution may prove fruitful in understanding the mod- 
ular nature of complex systems. 



FIG. 5. (Color online) The distribution of community sizes found in 
the genealogical network for (a) fast unfolding and (b) infomap. We 
see that neither distribution is heavy tailed, being instead approxi- 
mately exponential (straight lines) except for a small number of the 
largest communities that would be underestimated by an exponential 
distribution. The inset shows that unlike the community size distri- 
bution, the degree distribution of the network is heavy tailed. The 
straight line shows a pure power law, Pr(k) ~ k~ 23 , for comparison. 



V. STATISTICAL TESTING 

Given that there exist high modularity partitions in both the 
Cayley tree and the mathematics genealogy, a crucial ques- 
tion becomes, are these partitions significant in some way, 
or could they be simply due to some random process? This 
is especially important since it is known that sparse, uncor- 
rected graphs can potentially possess high modularity parti- 
tions due to fluctuations 1 38 -40 1 . To address such questions 
of statistical significance requires first defining an appropriate 
null model. Hypothesis testing then asks what is the probabil- 
ity that the observed phenomena (in this case the discovered 
communities or their properties) may have arisen within the 
null model. If this probability is sufficiently low, then there is 
evidence that the communities cannot be explained by the null 
model. This does not mean that the communities are "mean- 
ingful," however, since this only compares them to that partic- 
ular null model. For example, a simple choice of null model 
is the configuration model: build uncorrelated random graphs 
that preserve the degree sequence of the original graph, ap- 
ply community detection to these graphs, and then compare 
the configuration model communities to those of the origi- 
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nal network. However, a Cayley tree is typically very differ- 
ent from its equivalent configuration model ensemble, being 
highly structurally ordered, and this alone may lead to statis- 
tically significant differences in, e.g., modularity. Thus it is 
crucial to choose the most appropriate null model possible. 

Defining statistical tests for community structure remains 
an area of research [ 47 48 1 . For our purposes, we use the test- 
ing procedure introduced in BTl . Roughly, this test takes the 
worst node w in a community c (the node with the least neigh- 
bors also inside c), removes w from c, and asks what is the 
probability p that w would have that many neighbors or more 
within c if its links were distributed randomly over the whole 
graph while holding the rest of c and the degrees of all other 
nodes fixed. Since w represents the worst case, if p is small, 
then it is unlikely that w or any other nodes in c would have 
so many other neighbors also in c due to chance. Therefore, 
they consider a community to be significant if p < 0.05, the 
standard significance level for hypothesis testing. Note that 
this test strictly controls for both the links inside c (and there- 
fore its density), the number of links exiting c, and the overall 
sparsity of the network. The authors in [47 1 used this test to 
show that communities in sparse Erdos-Renyi and power law 
graphs are not significant. For full details, see ll47l . 

Here, we compute a p for each community of interest using 
their method and ask what fraction of communities are signif- 
icant. 

For the analytic partitions of the Cayley tree (Sec. |III A| >, 
we find that all z + 1 branch communities are statistically sig- 
nificant. (For z = 3 and g = 7, for example, the maximum 
p = 0.00144.) This means that this particular partition can- 
not be explained simply by the global sparsity of the tree it- 
self. In particular, this test considers the internal structure of a 
community as fixed, and yet these communities are significant 
even though they are internally maximally sparse trees. This 
contradicts the typical intuition of communities as internally 
dense, externally sparse subgraphs and supports the argument 
that bottlenecks alone are sufficient to significantly optimize 
modularity. 

We next test the analytic partition of the clique-tree exam- 
ple (Sec. M . We consider for n dique = 100 all z = 2,3,4 
and g = 27777" , 10 [see also Fig. |3ja)]. In all cases both com- 
munities were significant: the maximum p observed from any 
combination of those parameters was p = 0.00387. 

We now turn to the community discovery methods fast un- 
folding and infomap. For fast unfolding on the Cayley tree 
(z = 3, g = 7) we find that approximately 91% of the dis- 
covered communities were significant. This shows that even 
practical methods can find statistically significant, high mod- 
ularity partitions solely through the discovery of bottlenecks. 
For infomap, which does not optimize modularity, we find that 
no communities are statistically significant according to this 
test. However, we remark that a less strict test also introduced 
in [47] shows that approximately 48% of the infomap com- 
munities are significant. The truth likely lies between these 
extremes, but we can safely conclude that most infomap com- 
munities could be explained by this test's null model. 

Next, we test fast unfolding and infomap on the finite 



ble results to the partitions of the Cayley tree: for A — 4 and 
g - 6, we find that 96.9% ± 0.808% fast unfolding commu- 
nities were significant, while almost no (0.031 1% ± 0.0350%) 
infomap communities were significant. 

Finally, we consider the practical example of the mathe- 
matics genealogy. Both Fast unfolding and infomap found 
apparently high values of modularity, but are these results sig- 
nificant? Applying this test shows that they are not: for Fast 
Unfolding and infomap only approximately 2.4% and 2.6% of 
the communities were significant, respectively. Although we 
again caution that this does not necessarily prove these com- 
munities to be meaningless, it further underlines the potential 
danger of relying upon raw modularity values as a quantifier 
of modular structure. 



VI. DISCUSSION AND CONCLUSIONS 

We have shown that trees appear very modular, yet con- 
nected trees are maximally sparse and possess no density fluc- 
tuations, going against the tenet that communities are unusu- 
ally dense subgraphs. Thus, counter to our intuition, measures 
such as modularity, while ostensibly rewarding densely inter- 
connected groups, can actually be optimized solely through 
the discovery of bottlenecks, and it is not necessary for the 
discovered groups to be internally dense. In particular, we do 
not claim that trees lack communities, nor do we claim that 
these communities are not meaningful in some way. Instead, 
we argue only that it is sufficient to discover bottlenecks to op- 
timize modularity and conductance. This disconnect between 
intuition and practice has not been well discussed in the liter- 
ature, and in fact most work has overlooked the outsized role 
that bottlenecks play in the existence of modular structure. 

So is our definition of modular structure correct? Equa- 
tion (|2} depends so strongly on its null model that we must 
judiciously understand all facets of it. We have shown that 
communities do not need a significantly high internal density 
to lead to high quality (according to modularity). Therefore, 
if researchers want to consider modules according to their 
intuition, they may need to introduce measures that specifi- 
cally account for internal density in some way beyond that of 
Eq. Taken together with modularity's other issues such as 
its resolution limit, it appears that rigorously and unambigu- 
ously quantifying modular network structure is difficult and 
remains an open question. 

Researchers have shown that sparse graphs will have high 
modularity, yet the statistical tests applied here show that the 
sparsity of trees alone is not sufficient to explain these results. 
By controlling for tree sparsity, we have shown that bottle- 
necks lead not only to high modularity but to statistically sig- 
nificantly high modularity. Our results on trees further differ 
from sparse random graphs in that the expected high modular- 
ity partitions do not need to be equipartitions (see Appendix), 
and the derivations here do not invoke features of ensembles 
of random graphs. 

One may suspect that the addition of nontree components 
to a network may destroy the observed phenomena, that it is 



Galton- Watson trees discussed in Sec. IV We find compara- somehow fragile, yet our results in Sec. Ill B show that this is 
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not the case and that merely the presence of trees may lead to 
modular structure. A crucial consequence of this is that, since 
trees are the limiting structure as networks become sparse, 
sampled and missing data [49] may boost modularity, at least 
in some regions of the network, even though the network re- 
mains globally connected. Incomplete data remain an issue 
in high-throughput biological assays, for example lf50l , and 
thus one should consider both sparsity and bottlenecks when 
approaching graph partitioning in these problems. 

Finally, the statistical tests we used in Sec. [V] show that 
many of the communities found in trees are significant, 
whereas the communities found in the mathematics geneal- 
ogy, while very high in modularity, are typically not signifi- 
cant. However, this test does not verify that the tree commu- 
nities are "meaningful," only that they differ from the test's 
null model. Likewise, the discovered genealogical communi- 
ties could still be meaningful in other ways, perhaps reveal- 
ing important schools of mathematicians or mathematics re- 
search. For networks that possess additional data annotating 
the properties or roles of network elements, for example gene 
ontology terms describing proteins in protein-protein interac- 
tion networks [50 -53 1, these discovered groups may in fact be 
highly enriched, meaning that their nodes or links share many 
annotations l30ll54l . even though structurally the community 
is not significant. Further study of the interplay between these 
different validation mechanisms may be crucial to increasing 
our understanding of modular networks and complex systems 
in general. 



branch communities. Neglecting the singleton community, 
which has a vanishing contribution to Q anyway, the com- 
munities are all the same size. We showed in Sec. Hill that 
the modularity of this partition Qcayiey can become arbitrarily 
close to 1. This is not the only arbitrarily high modularity 
partition present in the Cayley tree. 

To see this, take one of the z + 1 branch communities and 
"shatter" it such that all nodes in that branch now form sin- 
gleton communities of their own. The modularity ^shattered of 
this partition is 



Q 



shattered 



m 
M 



■(-) 
\2Ml 



N(g) - 1 n(g) 



z+l 

n(g) I 1 
z + 1 \2M 



z+l 



+ 1 



z + V 
2M , 



(A.l) 



Here the first term is the contribution of the remaining z "un- 
shattered" branches, the second term accounts for the losses 
due to singleton interior nodes, both the root node and the 
shattered branch interior, and the last term accounts for losses 
due to the singleton leaf or interface nodes of the shattered 
branch. The quantities m, M, K, n(g), and N(g) correspond to 



those derived in Sec. Ill A Substituting these into Eq. (TAT 



gives 
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Appendix: Another high modularity partition of the Cayley tree 

Consider the analytic partition of the Cayley tree, where the 
root node occupies a singleton community alongside the z+l 



As expected, this value is smaller than the limit Qcayiey — > 
z/(z+ 1) as g — > oo but it shows that this partition still achieves 
arbitrarily high values of modularity. 

As mentioned in the main text, it has been shown that 
sparse, uncorrected random graphs may possess high mod- 
ularity partitions 11381 [391 . Under these conditions all com- 
munities should be equivalent in expectation and thus one ex- 
pects all communities to be roughly comparable in size, so 
that the community structure must be an equipartition of the 
network. The high value of modularity displayed in Eq. \A.2\ 
shows that trees, while also being sparse, can contain high 
modularity partitions that are very far from equipartitions, in 
contrast to the results of lF38ll39l . 
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